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FILTERING  AND  CONTROL  PROBLEMS  FOR  PARTIAL 
DIFFERENTIAL  EQUATIONS 

A.  V.  Balakrishnan t 

1 . Introduction 

Many  problems  of  filtering  and  control  involving  partial  differential 
equations  can  be  described  abstractly  in  a manner  similar  to  the  ordinary 
differential  equation  case  by  invoking  the  theory  of  semigroups  of  operators 

[1].  v We  have  thus  a 'state  equation'  of  the  form 

\ 

x[ty  = Ax(t)  + Bu(t)  + Fn(t) 

where  A is  the  infinitesimal  generator  of  a strongly  continuous  semigroup, 
u(.)  is  the  'input',  n(.)  is  the  noise  and  B and  F are  bounded  linear 
operators.  The  'observation'  y(t)  has  the  form 

y(t)  = Cx(t)  + Gn(t) 

where  C and  G are  bounded  linear  operators.  An  optimal  control  and  filtering 
theory  for  such  a system  has  been  developed  in  [1].  However,  the  extension 
to  the  case  where  the  operators  B,  F and  C are  not  bounded  turns  out  to  be 
quite  significant.  In  fact,  they  need  to  be  not  only  unbounded;  but  also 
uncloseable.  We  shall  treat  two  typical  cases  illustrating  such  extensions. 
The  first  problem  (B  unbounded)  occurs  when  the  control  is  on  the  boundary 
(of  which  there  is  no  finite  dimensional  analogue).  The  second  problem,  of 
C unbounded,  uncloseable,  occurs  whenever  we  wish  to  consider  "point-wise" 
observations  (in  the  domain  or  on  the  boundary). 

We  begin  with  the  boundary  control  problem. 
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2.  Boundary  Control 

In  this  section  we  shall  consider  the  linear  (deterministic)+  quadratic 
regulator  problem  for  a class  of  diffusion  equations  when  the  control  is  on  the 
boundary.  Lions  [3]  has  treated  such  a problem  but  he  requires  that  the  control 
be  in  H^2  on  the  boundary  (in  particular  requiring  c“  boundaries).  This 
restriction  seems  somewhat  artificial  and  in  any  case  here  we  require  only  that 
the  controls  be  in  L2  on  the  boundary.  Also  we  exploit  semigroup  theory.  The 
main  tool  is  the  construction  of  a generalized  solution  for  boundary  inputs 
(see  [1]  for  an  elementary  exposition). 

Let®  denote. a bounded  domain  with  boundary  r in  real  Euclidean  space 
Rn.  Points  in  the  space  will  be  denoted  by  £,  with  components  Let  t 
denote  a second  order  strongly  elliptic  operator: 

mm  .2f  m 

xf  = Z Z a ..(C)  wrAy-  + £ a.(c)  in  S (2.1) 

1 1 1J  35i  3Cj  1 1 

where  the  coefficients  are  continuous  in  the  closure  of  S. 

We  consider  the  control  problem  for  the  equation: 

= xf  0 < t,  C in  S » 

f(t,.)|r  = u(t,.)  (2.2) 

f(0,£)  given  ] 

v/here  u(t,.)  is  in  L2(r)  for  each  t.  Before  we  specify  the  cost  function, 
we  shall  develop  the  semigroup  theoretic  approach  to  the  solution  of  (2.2). 

For  this  purpose  we  first  consider  the  special  case  of  controls  u(t,.)  which 
are  strongly  continuously  differentiable  in  0 < t < °°. 

+For  a treatment  of  stochastic  boundary  control  see  [2], 
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Let  A denote  the  'zero  boundary  value1  restriction  of  t,  more  specifi- 
cally, let  A denote  the  smallest  closed  extension  in  L2(2>)  of  t restricted 
to  C“(S>).  Assume  that  x is  strongly  elliptic  so  that  the  quadratic  form: 


mm  m 

l l a..  (K)  X.Xi  > 3 l x/,  3 > 0,  for  all  g in  . 

1 1 J J 1 


Then  A generates  a strongly  continuous  semigroup  over  L2(S>),  analytic  in  a 
sector  in  the  right-half  plane.  Denote  the  semigroup  S(t).  We  assume  next 
that  the  boundary  r is  such  that  the  Dirichlet  problem 

*N 


where  g is  in  L2(r)  has  a unique  solution  given  by 


f = Dg 

where,  furthermore,  D is  a linear  bounded  transformation  mapping  L2(r)  into 

L2(fl). 

Then  following  a technique  due  to  Fattorini  [4]  we  can  express  the 
solution  of  [2.2]  in  abstract  form  as  [see  [1]]: 

x(t)  = S(t) (x(o)-Du(o) )+ J"  S(t-s)  Du(s)ds  (2.3) 

o 

where  x(o)  is  the  initial  function  (assumed  to  be  in  L2(2))).  For  each  t, 
x(t)  describes  the  solution  as  an  element  of  L2(D)  and  is  strongly  continu- 
ous in  t.  For  x(o)  in  the  domain  of  A,  the  solution  satisfies  (2.2)  and 
moreover 
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| |x(t)  - x(o) 1 1 - 0 , as  t - 0+ 


and  of  course  x( t)  'assumes'  the  boundary  value  u(t).  See  [1]  for  the 
uniqueness  of  the  solution  (2.3). 

While  (2.3)  is  an  acceptable  solution,  it  has  the  disadvantage  that 
the  derivative  of  u(t)  appears  and  we  want  to  avoid  this.  It  is  here  that 
we  introduce  our  notion  of  a 'generalized'  solution,  sacrificing  the 
"pointwise"  interpretation  of  (2.2). 

The  Generalized  Solution  of  the  Boundary  Input  Problem 
We  exploit  now  the  fact  that  the  semigroup  S(t)  is  analytic.  From 
this  it  follows  [see  [9  for  a proof]  that  for  any  u(.)  in 

Wb  = L2[[0,T];  L2(r)] 

we  have  that 

f S(t-cr)  Du(a)da 
*/o 

belongs  to  the  domain  of  A a.e.  in  [0,T]  and  that  a.e: 

/t  /*  t 

S(t-a)  Du(a)da  = / AS(t-cr)  Du(a)da  = g(t)  0 < t < T 

o *o 

where  g(t)  belongs  to 

W$  = L2[0,T;  L2(2>)] 

Moreover 


a.e 


(2.4) 


t 

Lu  = g;  g(t)  = AS(t-a)  Du(cr)da 
tr 

defines  a linear  bounded  transformation  mapping  Wb  into  Ws.  Integrating 
by  parts  in  (2.3)  and  exploiting  the  fact  that  L is  bounded,  we  can  obtain 
the  'generalized  solution*  which  is  now  valid  for  any  u(.)  in  W^,  as: 

/t 

AS(t-s)  Du(s)ds  a.e  0 < t (2.5) 

u 

and  here  the  a.e.  qualification  is  crucial.  For  the  details,  we  refer  once 
again  to  [1],  Note  that  A commutes  with  S(t)  on  the  domain  of  A and  hence 
it  is  as^  jf  we  have 

AS(t-s)  Du(s)  = S(t-s)  Bu(s) 

where  B roughly  speaking  is  "AD",  although  of  course,  A is  simply  not 
defined  on  the  range  of  D! 

We  can  now  specify  the  optimization  problem.  Given  Q,  linear  bounded, 
mapping  (2>)  into  another  Hilbert  space,  it  is  required  to  minimise: 

rT  rT 

q(u)  = / [Qx(t),  Qx(t)]dt  + / [u(t),u(t)]dt  (2.6) 

*/o  J o 

for  u( . ) in  Wfa,  with  x(.)  given  by  (2.5).  Note  that  we  can  recast  (2.5)  in 
the  form: 

x(t)  = S(t)x(0)  - A y(t)  a.e  \ 

y(t)  = A y(t)  + Du(t)  a.e  > (2.7) 

y(0)  - 0 ) 
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As  the  first  step  towards  the  characterization  of  the  optimal  control, 
let  us  calculate  first  L*,  the  adjoint  of  L since  it  will  play  a key  role. 

We  have 

L*x  = u;  u(t)  =/*T-  D*  (AS(s-t))*  x(s)ds  a.e.  (2.8) 


Now  for  t > 0 

D*  ( AS ( t ) ) * = D*  A*  S(t)* 

Let  us  set 

' ** 

C =D*  A* 

Then  C is  clearly  defined  on  the  domain  of  A*  (and  hence  also  in  the  range 
of  S(t)*  for  t > 0)  but  C is  not  closeable.  In  fact  in  the  simple  case 
where  t is  actually  the  Laplacian,  we  can  see  (by  Green's  theorem)  that  C 
reduces  to  the  normal  derivative  on  the  boundary  r and  hence  is  clearly  not 
closeable.  Indeed  if  C were  closeable,  its  adjoint  would  be  AD  which  is  of 
course  not  defined  except  on  the  zero  element. 

Of  course 

C S(t)* 

is  linear  bounded  for  each  t > 0 and 

f CS(s-t)*  Q*Q  x(s)d$  a.e  0 < t < T 
Jt 

defines  an  element  of  W.  for  x(.)  in  W . 

b s 


By  the  same  kind  of  theory  as  in  the  finite  dimensional  case,  we  can 
establish  the  existence  of  unique  optimal  control  uQ( . ) given  by  (see  [1] 
for  instance): 

uQ  = - (QL)*  (Q  Lu0  + Qu)  = -(QL)*  Q xq(-) 
where  u is  the  element  in  W$  defined  by 
S(t)  x(0)  0 < t < T 

We  can  express  uQ( . ) as: 

V v 

uQ(t)  = - C z(t) 
where  z(t)  is  the  solution  of 

z(t)  + A*  z(t)  = -Q*Q  xo(t)  . ; z(T)  = 0...  (2.9) 

By  analogy  with  the  finite  dimensional  case,  we  expect  that  we  must  be 
abie  to  express  z(t)  as 

z(t)  = p(t)  xQ(t) 

where  P(t)  is  a linear  bounded  self-adjoint  transformation  mapping  into 

itself,  satisfying,  for  x,y  in  domain  of  A,  a.e.  in  t: 

[P(t)x,y]  + [P(t)Ay,x]  + [P(t)x,Ay]  ^ 

- [CP(t)x,CP(t)y]  + (Qx,Qy]  = 0 V (2.10) 

P(T)  = 0 ) 


Note  this  implies  in  particular  that  for  any  x in  l^t®). 


P(t)x  e Domain  of  C,  a.e.  0 < t < T 
for  which  it  is  enough  if  for  x e (A), 

P(t)  e Domain  of  A*,  a.e.  0 < t < T 

Assume  now  that  we  have  found  such  a P(t).  Then  we  can  show  that  P(t)  xQ(t) 
satisfies  (2.9).  We  have 

[P(t)  x0(t),y] 

= [s(t)  x(0)  + Ay0(t),  P(t)y] 

where 

y0(t)  = Ay0(t)  + D u0(t);  y(0)  = 0 
The  main  calculation  is  that 

[P(t)  x0(t),y]  = [P(t)  AS ( t)  x(0) ,y]  - [A  y(t)  + D u(t),  A*P(t)y] 

= [x0(t).  A*  P(t)y]  - Cu(t).  CP(t)y] 

= [xQ(t),  A*  P(t)y]  + CCP(t)  x0(t),  CP(t)y] 

A little  arithmetic  shows  that 

P(t)  xQ(t)  (2.11) 

does  satisfy  (2.9),  which  of  course  has  a unique  solution.  We  also  have  that 
q(uQ)  = [P(0)  x (0),  x (0)]. 

The  main  problem  is  then  to  show  existence  of  solution  of  (the  Riccati 
equation)  (2.10).  The  main  difficulty  arises  from  the  fact  that  C is  unbounded, 
uncloseable,  and  is  not  treated  in  current  works  on  the  operator  Riccati  equation 
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[4,5],  Here  we  shall  exploit  the  dual  filtering  problem  for  this  purpose, 
which  is  also  of  interest  on  its  own. 

3.  A Filtering  Problem 

Let  us  consider  the  stochastic  system: 

x(t,u))  = A*x(t,oi)  + F w(t),  a.e.  0 < t < T;  x(o)  = 0 (3.1) 

where  A is  the  same  generator  as  in  section  2,  F is  linear  bounded,  mapping 

H$  x Hb  into  Hb,  such  that  FF*  = Q*Q  and, 

U-|(t) 

w(t)  = 

u2(t) 

where  <^(*)»  a)2(*)  are  independent  standard  white  Gaussian  noises  in  W$  and 
Wb  respectively,  and  hence  u(*)  defines  similar  white  noise  in  x Wb<  We  have 
of  course  that 

t 

x(t,w)  = J S*(t-o)  Faj(a)da 
o 

and  for  each  u>,x(t,u>)  is  continuous  in  t. 

The  observation  is 

y(t,w)  = C x (t,w)+  w(t)  a.e.  0 < t < T (3.2) 

where 


FG*  = 0 

GG*  = Identity 

We  have  'distributed'  state  noise,  and  the  'observation'  noise  is  on  the 
boundary.  The  filtering  problem  for  such  a system  when  C is  bounded  (so  that 
Cx(t,u)  is  defined  for  every  t)  is  treated  in  [1],  Here  we  consider  a case 
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where  C is  unbounded,  uncloseable;  more  specifically,  we  take  C as  defined  in 
section  2.  Then  as  we  have  seen,  for  each  oj (•), 


x(t,a))  e Domain  of  C a.e.  0 < t < T. 

so  that  (3.2)  is  well  defined  for  each  oj.  Moreover 

t 

Cx(t,u)  = D*  A*  x(t,w)  = D*  f A*  S*(t-o)  F u(o)da 
t 0 

= f CS*(t-a)  F oj(a)da  a.e.  0 < t < T 

"N  o 

and  hence  Cx(*,to)  is  an  element  of  Wb  and 

thus  (3.2)  can  be  looked  upon  as  a mapping  into  Wb  for  each  u(*)  in 
Ws  x Wb  (Cartesian  product). 

Let  us  use  the  notation: 

Wb(t)  - L2[[0,t];  L2(D] 

W$(t)  = L2[[0,t];  L2(S>)] 

Then 


y(s,u)  0 < s < t 

is  in  W^( t)  and  is  a second  order  random  variable  therein, 

denoting  it  by  n(t,u>),  we  have: 

ECn(t,u)  n(t,u)*3  = i + (cl( t) ) (CL(t))* 

where 

s 

L(t)f  = g;  g(s)  = f S*(s -a)  F f(a)da  0 < s < t 
0 s 

CL(t)f  = g;  g(s)  = / CS*<s  -a)  F f(a)da  a.e.  0 < s < t 


In  fact 


and  L(t)  and  CL(t)  define  linear  bounded  transformations  on  W (t)  ® WL(t)  into 

s b 

Wg(t)  and  Wfa(t)  respectively.  Hence  we  know  that 
x(t,u>)  = E[  x(t,u>)  J r)(t,to)] 


= M(t)  (CL(t))*  [I  + (CL(t)  (CL(t))*)'1  rj(t,«u) 


(3.3) 


where 


*0 

M(t)f  = x;  x = / S*(t-a)F  f(a)da. 


Since 


C x(t,to) 


is  only  defined  a.e.  in  t,  we  cannot  talk  about  E(Cx(t,ao)  | n(t,oo)] 
and  we  must  instead  consider 
Cx( • ,«) 

as  an  element  of  W.  . We  have  then 
b 

Theorem  1. 


x(t,cj)e  Domain  of  A*  a.e.  0<  t<  T 


A*  x( •,(!)) 


\ \ j 


is  an  element  of  W„  for  each  to,  and  defines  a second  order  random  variable  in  W . 

s s 

Proof.  We  can  clearly  find  a sequence  of  linear  bounded  operators  Tn 
mapping  H$  into  the  domain  of  A*  and  such  that  Tn  converges  to  the  Identity 
strongly.  Hence,  for  each  t,  0 < t <T  : 


k 


I 


! 


; 


E[A*Tnx(t,u)  |n( t) 3 = A*Tn^(t,cj) 

Moreover,  we  have,  for  each  h ( • ) in  W$ 

E[A*(Tn-T)x(-,u),h]2 
< E[A*(Tn-Tm)  x ( • ,oj)  ,h]2 
Since  for  any  f cn  Ws, 

T t 

E( [x,f](x-x, f 3 ) - E (/  [x(t,w) , f ( t ) ] f Cx(s.«)  - x (s,oj), 
o o' 

T s 

E(  f [x(s,oj)  - x (s,oi),  f ( s ) ] J [x(t,co),  f(t)]dt  ds)=  0 

oJ  0 

But  the  random  variables  [A*Tnx(* ,co) ,h]  converge  to  [A*x(*,ui),h 
of  order  two.  Hence 

[A*Tnx(-,ai),h] 

converges  in  the  mean  of  order  two.  But  we  can  write 
[A*Tnx(-,aj),h]  = (o),Lnh] 
and  E[A*Tnx(-,oj),h]2  = || Ln h 1 1 2 
Hence 


Hence 


[ojjl^h]  converges  for  each  w 


[A*-^  x(  • ,u>)  ,h]  converges  for  each  oj  to  [Lu,h]. 
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f(s)]dsdt)+ 


in  the  mean 


j 


But  since  A*  is  closed,  it  follows  that 
£(t,oo)  e 2>  [A*]  a.e. 
and  of  course 

A*x(  • ,oo)  = Lui  e Wg . 

and  defines  a second-order  random  variable.  Hence 
<#(•,«)  = D*  A*x(-,co) 

defines  a second  order  random  variable  in  W.  also. 

b 

Again  let  denote  the  operator  such  that 
x(  •,(*))  =^y(*,o)) 

where  5” is  of  course  linear  bounded,  as  can  be  verified  from  (3.3).  Similarly  let 
x(*  ,<o)  = Leo 

Then 

A*$(  • ,<o)  = A*^(Gai+L(j) 

Given  any  element  h in  we  can  find  to  such  that 
G u)  + Loo  = h 

Indeed  we  have  only  to  take 
to  = G*h 
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Hence  .g’maps  Wb  into  the  domain  of  A*  and  hence  A* 5? and  hence  also  Z2  are 
both  linear  bounded. 

Next  we  shall  show  that 


z(*,ui)  = y(*,w)  - Cx(* ,(i)) 


is  white  noise  in  W^.  But  this  can  be  proved  as  in  [1]  using  the  fact  that 
T 4 

E(  / [x(t,u>)  - Cx(t,w),h(t)]  / [y(s,(j)  - C$(s,a>),h(s)]ds  dt)  = 0 


E([x(t,w)  - x(t,w).h]  [x(t,w)  - x(t,w),g])  = [P(t)h,g] 

Our  object  is  to  show  that  P(T-t)  satisfies  (2.10). 

For  this  let  us  note  that 


y(*,cj)  = (I  - C 2T  z(*»u) 


x(- ,a>)  = 2 (I-C2)"1  z(*,to) 
(I  - CS*(l/n)^_1  - I - Mn 


where 


M is  Vol terra  and  so  is5?(I-M  ), 
n n 


^(I  - converges  strongly  to.2?(I  ~ C2)  . Hence  it  follows  that 

x(*,oj)  = limi/Kl  - z x(t,w)  = E[x(t,a))|  £(t,u>)] 


- F(tH(t,io) 


S(t,w)  = z(s,<j)  0 < s < t 


as  an  element  of  Wb ( t ) . Also 


E[  x(t,w)  5(t,u)*]  = F(t) 


But  for  h ( • ) in  Wb> 


E([x(t,u)),x]  f [z(o,a)),h(a)]da) 


= E[(x(.t,ai) ,x]  f [Cx(a,u)  - C$(a,a>),  h(a)]da 


Now 


E([  x(t,aj),x]  [ x(a,ai)  - $(a,aj),  y ) ] ) 


“ E[(S(t,a)x(a ,u>),x]  [x(cr,aj)  - x(o,ioj,y] 


[P(a)S(t-a)x,y] 


and  hence  (3.4) 


= / [CP(a)S(t-o)x,h(a)]da 


where 


P(a)S(t-a)x  e Domain  of  A*  a.e.  0 < a < t 


and 


/ | |A*P(a)S(t-a)x| 1 2da 


< “ 


and  hence  also 


t 

f HD*A*P(a)S(t-a)xll2da  < » 

But  (3.5) 


« Cx,F(t)h(-)] 


and 


where 

\ 


Hence 


|[x,F(t)h]|2  < [R(t,t)x,x]  f | |h(a) | |2do 

o 

t 

R(t,t)x  = E(x(t,u>)x(t,w)*]x  = / S*(a)FF*S(  o)xda 

o 


Hence 


l|F(t)*x||2  < [R(t,t)x,x] ^ t| I x 1 1 2 1 | FF* 1 1 M( t ) , [M(t)  = Sup  ||S(a)| 

<_  a < t) 

t 1 

( / ||CP(a)S(t-c)x||do)  < t2||x||2||FF*||M(t) 


Ifeing  this,  we  shall  show  now  that  for  x in  the  domain  of  A, 
P(a)x  e 2i(A*)  a.e. 


Let  X be  sufficiently  large  and  positive  so  that  it  is  in  the  resolvent  set  of 
A.  Then:  letting 


Cn  - CS*(l/n) 
we  note  that 
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/ e'xt  f (C  -Cj  P(o)  S(t-o)xda  dt 

0 J n m 

00 

= J e“Xa(Cn-Cfn))P(a)R(X,A)xda 

where  R(A,A)  is  the  resolvent  of  A.  Moreover 

00 

/ e_Xa| |(Cn-Cm)P(a)  R( A,A)x| |da 
00  t 

5 / e'Xt  / ll(Cn-Cm)  P(o)  S(t-a)  x ||<k>  dt 
0 0 

But  ' 

1 KCn”Cm>  P(a)  S^“a)xN 

= | |D*(S*(l/n)  - S*(l/m»A*P(a)S(t-a)x|| 
and 

||CnP(o)  S(t-o)x| | < | | D* | | | |A*P(cr)S(t-a)x| |.  (Constant) 

Hence  the  right  side  of  (3.6)  is  Cauchy. 

But 

e"X°S*(l/n)P(a)  S(t-a)x 

e“Xcr  P(cr)S(t-a)x 

and 

||e‘Xi|S*(l/n)P(o)S(t-o)x|| 

< | |e"XaP(a)S(t-0)x| |.  (Constant) 

17 


(3.6) 


■■■■ 


which  is  integrate  on  (0,®).  Hence  it  follows  (since  A*  is  closed)  that 


P(o)  R( A,A)x  e 2)(A*)  a.e. 

Hence  for  x in  the  domain  of  A, 

P(a)x  e Q)(A*)  a.e., 

and 

t 

f | |A*P(a)x| |da  < ® 

'v  0 

for  each  L > 0.‘ 

In  particular,  since  the  semigroup  is  analytic,  for  every  t > 0 
S(t)x  e domain  of  A*P(o)  a.e. 

Next  we  can  readily  see  that 

9 t ? 

E[x(t,w) ,xy  = f JfCP(a)S(t-a)x| pda 

o 

Hence 

t 

[P(t)x,y]  = f [S(a)*Q*Q  S(a)x,y]da 
0 t 

-/  [CP(a)S(t-a)x,  CP(a)S(t-a)y]da 

oJ 

For  x,y  in  the  domain  of  A,  it  follows  readily  that,  a.e.: 

[P(t)x,y]  = [P(t)Ax,y]  + [P(t)x,Ay] 

+ [Qx.Qy]  - [CP(t)x,  CP(t)y] 

More  P(T-t)  then  satisfies  (2.10)  as  required. 
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